Towards Universal Multilingual Knowledge Bases

نویسندگان

  • Gerard de Melo
  • Gerhard Weikum
چکیده

Lexical, ontological, as well as encyclopedic knowledge is increasingly being encoded in machine-readable form. This paper deals with knowledge representation in multilingual settings. It begins by proposing a generic graph-based knowledge base framework, and then, in three case studies, explains how preexisting knowledge can be cast into this framework. The first case study involves enriching WordNet with information about human languages and their relationships. The second study shows how machine learning techniques can be used to bootstrap a large-scale multilingual version of WordNet where semantic relationships between terms in many languages are captured. The final study examines how information can be extracted from Wiktionary to produce a lexical network of etymological and derivational relationships between words.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ULiS: An Expert System on Linguistics to Support Multilingual Management of Interlingual Knowledge bases

We are interested in bridging the world of natural language and the world of the semantic web in particular to support multilingual access to the web of data, and multilingual management of interlingual knowledge bases. In this paper we introduce the ULiS project, that aims at designing a pivot-based NLP technique called Universal Linguistic System, 100% using the semantic web formalisms, and b...

متن کامل

Multilingual Evaluation of KnowNet Evaluación Multiĺıngüe de KnowNet

This paper presents a new fully automatic method for building highly dense and accurate knowledge bases from existing semantic resources. Basically, the method uses a wide-coverage and accurate knowledge-based Word Sense Disambiguation algorithm to assign the most appropriate senses to large sets of topically related words acquired from the web. KnowNet, the resulting knowledge-base which conne...

متن کامل

Generation of Multilingual Personalized Environmental Bulletins from an OWL-based Ontology

Natural Language Generation (NLG) from knowledge bases (KBs) has repeatedly been subject of research. However, most proposals tend to have in common that they start from KBs of limited size that either already contain linguistically-oriented knowledge structures or to whose structures different ways of realization are explicitly assigned. To avoid these limitations, we propose a three layer OWL...

متن کامل

Exploiting Knowledge Bases for Multilingual and Cross-lingual Semantic Annotation and Search

The amount of entities in large knowledge bases (KBs) has been increasing rapidly, making it possible to propose new ways of intelligent information access. In addition, there is an impending need for systems that can enable multilingual and cross-lingual information access. In this work, we firstly demonstrate X-LiSA, an infrastructure for multilingual and cross-lingual semantic annotation, wh...

متن کامل

Automatic Generation of Multilingual Lexicon by Using Wordnet

A lexicon is the heart of any language processing system. Accurate words with grammatical and semantic attributes are essential or highly desirable for any applicationbe it machine translation, information extraction, various forms of tagging or text mining. However, good quality lexicons are difficult to construct requiring enormous amount of time and manpower. In this paper, we present a meth...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009